Identification and initial characterization of Hfq-associated sRNAs in Histophilus somni strain 2336

Small RNAs (sRNA), in association with the global chaperone regulator Hfq, positively or negatively regulate gene expression in bacteria. For this study, Histophilus somni sRNAs that bind to Hfq were identified and then partially characterized. The Hfq-associated sRNAs in H. somni were isolated and identified by co-immunoprecipitation using anti-Hfq antibody, followed by sRNA sequencing. Sequence analysis of the sRNA samples identified 100 putative sRNAs, out of which 16 were present in pathogenic strain 2336, but not in non-pathogenic strain 129Pt. Bioinformatic analyses suggested that the sRNAs HS9, HS79, and HS97 could bind to many genes putatively involved in virulence/biofilm formation. Furthermore, multi-sequence alignment of the sRNA regions in the genome revealed that HS9 and HS97 could interact with sigma 54, which is a transcription factor linked to important bacterial traits, including motility, virulence, and biofilm formation. Northern blotting was used to determine the approximate size, abundance and any processing events attributed to the sRNAs. Selected sRNA candidates were confirmed to bind Hfq, as determined by electrophoretic mobility shift assays using sRNAs synthesized by in vitro transcription and recombinant Hfq. The exact transcriptional start site of the sRNA candidates was determined by RNA ligase-mediated rapid amplification of cDNA ends, followed by cloning and sequencing. This is the first investigation of H. somni sRNAs that show they may have important regulatory roles in virulence and biofilm formation.


Introduction
Histophilus somni is one of the major etiologic agents responsible for bovine respiratory disease (BRD) complex, and is also responsible for multiple systemic diseases, including septicemia, thrombotic meningoencephalitis, myocarditis, arthritis, infertility, and abortion [1][2][3][4][5]. BRD is the result of complex interactions between H. somni, the environment, host, and other bacterial and viral pathogens [6]. Genome sequences, complete or partial, have been obtained for over 65 H. somni strains (NCBI, GenBank), which have enabled whole genome a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 isolated using the MasterPure™ DNA purification kit (Biosearch Technologies, Middleton, WI) and Zyppy plasmid miniprep kit (Zymo Research, Irvine, CA), respectively. The plasmids were introduced into E. coli by heat shock [28]. Phusion 1 High-Fidelity DNA Polymerase (New England Biolabs (NEB), Ipswich, MA) was used for polymerase chain reaction (PCR) amplification of DNA fragments. The DNA fragments were eluted and purified from the gel using the Zymogen gel DNA recovery kit (Zymo Research). Restriction and DNA modifying enzymes were purchased from NEB. The nucleotide sequences of recombinant strains were confirmed by sequencing (Psomagen, Inc., Brooklyn, NY).

Overexpression and purification of hexahistidyl-tagged Hfq (His6-Hfq)
For the over-expression of His 6 -Hfq, the complete coding region of Hfq (291 bp) of H. somni 2336 was amplified by PCR using the primer pair Hfq_overex_F/ Hfq_overex_R, and subcloned into the His-tag expression vector pET101/DTopo yielding pET-hfq. The recombinant plasmid was introduced into E. coli One Shot™ BL21 Star™ (DE3) cells (Invitrogen, Carlsbad, CA) by transformation. The overexpression of Hfq was induced by adding 1 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) at an OD 600 of 0.6, and incubation of the culture was continued for 5 h at 37˚C. Induced cells were harvested by centrifugation (5000 x g for 15 min)

Production of antibodies to Hfq
Three young adult female Sprague Dawley rats were injected subcutaneously with an emulsion containing 30 μg of purified recombinant Hfq mixed with 120 μl of PBS and 150 μl of Freund's complete adjuvant (Thermo Fisher Scientific). Before the injection, 300 μl of peripheral blood was collected from the tail vein to obtain pre-immune serum while the animals were restrained in a restraining tube. Three weeks later, 300 μl of blood was collected to determine the serum antibody titer, followed by a booster injection containing 30 μg of purified Hfq in Freund's incomplete adjuvant (Thermo Fisher Scientific), as described above. The animals were boosted a third time three weeks later with 100 μg of purified Hfq in TiterMax Gold adjuvant (Sigma Aldrich, St. Louis, MO). Three weeks after the third immunization, blood was collected from the rats, and an antibody titer of >1:10,000 to purified Hfq was confirmed by enzyme-linked immunosorbent assay (ELISA), as described [29]. A fourth booster injection of 300 μg of purified Hfq in PBS was given intravenously at this time. Three days after the final booster, the rats were euthanized with CO 2 and exsanguinated by cardiac puncture. The blood was allowed to clot, and the sera stored at -20˚C. All procedures on animals were approved by the Long Island University Institutional Animal Care and Use Committee under protocol ID number 2021-001, February 17, 2021.

Isolation of Hfq-associated sRNA
Hfq-associated sRNAs were co-immunoprecipitated, as previously described [21]. Anti-Hfq IgG was isolated from the serum using the ImmunoPure 1 Melon TM gel IgG spin purification kit (Thermo Fisher Scientific), according to the manufacturer's instructions. After dialysis against coupling buffer, the purified IgG was coupled to PIERCE TM NHS-activated magnetic beads (Thermo Fisher Scientific), as recommended by the manufacturer, and stored at 4˚C until used. Hfq and associated sRNAs were co-precipitated from cell extracts of H. somni after growing the bacteria, as described above, to stationary phase. Following centrifugation (5000 x g for 15 min) the bacteria were washed in binding/washing buffer [50 mM Tris-HCl (pH 7.5), 150 mM NaCl, 0.05% Tween 20, and protease inhibitor (Thermo Fisher Scientific)] followed by sonication to lyse the bacteria (20% Amp, 5 sec bursts with 30 sec intervals on ice for a total of 30 min). The unbroken cells were removed by centrifugation at 14,500 X g for 45 min, and the supernatant was collected. The co-immunoprecipitation of Hfq and associated sRNAs were carried out according to the protocol given for PIERCE TM NHS-activated magnetic beads (Thermo Fisher Scientific). The Hfq-associated sRNAs were separated from Hfq using TRIzol TM LS reagent (Invitrogen) according to the manufacturer's protocol. To control for nonspecific binding of sRNAs to the magnetic beads, the immunoprecipitation process was carried out with H. somni cell extract and magnetic beads not coupled to anti-Hfq IgG.

RNA extraction
Total RNA, including microRNA, were extracted from H. somni cells using the Qiagen RNeasy1 Mini Kit (Qiagen) and mirVana TM miRNA isolation kit (Life Technologies, Carlsbad, CA), respectively, following the manufacturer's protocol. The quality and quantity of the RNAs were evaluated by NanoDrop (Thermo Fisher Scientific) and gel electrophoresis.

RNA sequencing and data analysis
The RNA samples were processed for cDNA library construction and sequencing at the Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN. The cDNA libraries were prepared using the TruSeq small RNA preparation kit (Illumina Baltimore, Baltimore, MD) and the sequencing was performed using Illumina NextSeq 75 cycles High Output kit (Illumina) following the manufacturer's protocol. Reads were adapter-trimmed and quality filtered using Trimmomatic 0.38. An average base quality score of 20 over a window of 3 bases was set as the cutoff threshold. Cleaned reads were mapped to the H. somni 2336 reference genome sequence retrieved from NCBI (NC_010519.1) using Geneious Prime 1 2021.0.3 (http://www.geneious.com). Geneious Prime1 2021.0.3 was set to the default parameters to generated the read coverage and read counts per gene. All intergenic regions of >30 bp were considered for read count analysis. The promoter and rho-independent terminators of the intergenic regions were predicted by BPROM (Softberry Inc., Mt. Kisco, NY) and ARNold [30], respectively. The criteria used for the prediction of promoters and terminators is detailed in S1 File. sRNA secondary structures were predicted by RNAfold webserver with default parameters [31]. The complete RNA sequence for sample GSF2751-sRNA Hfq-8 is accessible through NCBI BioProject under accession number PRJNA845786 (https://www.ncbi.nlm.nih. gov/bioproject/PRJNA845786). GSF2751-sRNA Hfq-8 is one of six repetitive samples from the same strain.

In vitro RNA transcription
The DNA sequences of the sRNAs HS9, HS79, and HS97 were amplified from H. somni genomic DNA by PCR using forward primers harboring a T7 promoter sequence at the 5'-end. The PCR products were used as template for the synthesis of sRNAs HS9, HS79, and HS97 using MAXIscript 1 Kit (Invitrogen), according to the manufacturer's instructions. The transcribed product was treated with DNAse to remove the DNA template, followed by purification of the RNA using RNeasy1 MinElute1 Cleanup Kit (Qiagen) according to the manufacturer's protocol. The specificity and quality of the transcribed RNAs were confirmed by TBE (Tris/ Borate/Ethylenediaminetetraacetic acid)-Urea gel (15% acrylamide) electrophoresis.

Northern blotting
Based on the sequence similarity to currently known sRNAs that affect virulence in other Gram-negative bacteria, three of the sRNA candidates (HS9, HS79, and HS97) were selected for further studies. Northern blotting of the sRNA candidates was conducted using North2-South 1 Chemiluminescent Hybridization and Detection Kit (Thermo Fischer Scientific) according to the manufacturer's protocol. Briefly, the RNA isolated from H. somni cells was loaded onto a TBE-Urea gel (15%) and run at a constant voltage of 180 V for 85 min until the bromophenol blue tracking dye reached the bottom of the gel. As a size marker, a low range ssRNA ladder (New England Biolabs, MA, USA) was also loaded. Following electrophoresis, the RNA gels were rinsed in five gel volumes of 0.5x TBE buffer for 30 min. RNA was then transferred onto nylon membranes (Biodyne TM Precut Nylon Membranes, Thermo Fisher Scientific) at 20 V for 90 min, using Trans-Blot SD Semi-Dry Transfer Cell (Biorad, Hercules, CA). The RNA was then cross-linked to the membrane by baking at 80˚C for 15 min before the hybridization process. Pre-hybridization and hybridization procedures were carried out according to the manufacturer's protocol. For hybridization, denatured (boiled) 5'-biotinylated probes (26 bp in length; 30 ng per ml) were added to the hybridization buffer and incubated with the membranes overnight with shaking at 42˚C. The membranes were washed and developed according to the manufacturer's protocol, followed by imaging with ChemiDoc Imaging System (Biorad).

RNA ligase-mediated (RLM)-rapid amplification of cDNA ends (RACE)
RLM-RACE was performed using FirstChoice 1 RLM-RACE Kit (Thermo Fisher Scientific) according to the manufacturer's protocol. The PCR products were visualized by agarose gel electrophoresis, purified, cloned into pUC19, and the recombinant clones were sequenced to identify transcriptional start sites.

Electrophoretic mobility shift assay (EMSA)
EMSA was carried out to confirm the interaction of the three sRNAs selected (HS9, HS79, and HS97) with recombinant Hfq. Two hundred ng of in vitro transcribed sRNAs and recombinant Hfq (varying concentrations of 0.65 μM-5 μM) were added to 15 μl of reaction mixture buffer (consisting of 10 mM Tris-HCl, pH 7.4, 6% glycerol, 50 μg/ml BSA, and 0.75 μg Baker's yeast tRNA), and incubated at room temperature for 30 min. The reaction mixtures were electrophoresed on a pre-run (100 V for 20 min) Novex TM retardation gel (6% native polyacrylamide gel) (Thermo Fisher Scientific) in 0.5x TBE buffer at 100 V for 90 min followed by staining with GelGreen 1 nucleic acid gel stain (Biotium, Fremont, CA) for visualization.

Enrichment of Hfq-associated transcripts by co-immunoprecipitation
H. somni 2336 cells were grown in CTT broth to early stationary phase because Hfq expression has been shown to be upregulated in stationary phase cultures [32]. The H. somni cell extract was incubated with magnetic beads coupled with anti-Hfq IgG to bind Hfq and the associated sRNAs, as described in Materials and Methods. The successful recovery and quality of the extracted sRNAs were confirmed by RNA gel electrophoresis, followed by analysis using the Agilent 4200 TapeStation system (Agilent Technologies, Inc., Santa Clara, CA). cDNA libraries were constructed from 6 independent sRNA preparations that had the greatest RNA concentration and least salt contamination, followed by deep sequencing them individually using Illumina technology. The results of the RNA sequencing are presented in Table 2.

Mapping of identified RNA sequences
Illumina sequencing generated a combined total of 20 million reads from the cDNA libraries created from 6 sRNA samples. The size of the sequencing reads was adequate for mapping

Analysis of the intergenic regions of the H. somni genome
The aligned RNA-seq based transcriptome map was examined in comparison to the H. somni genome, and intergenic regions in the genome were identified. Promoters and terminators were predicted to confirm the identified transcripts using the online tools BPROM (Softberry Inc., Mt. Kisco, NY) and ARNold [30], respectively. A total of 100 sRNA candidates were identified at the intergenic regions ( Table 3), out of which 43 have been previously reported, and of which 16 are unique to strain 2336 [8]. Of the sRNAs identified in this study, three were housekeeping sRNAs (ssrS, ffs and rnpB), which are well characterized in other bacterial strains [33][34][35]. The majority of the identified sRNA candidates (60%) were shorter than 200 bases (size range 53-547 bases). Promoters and rho-independent terminators were predicted for 85 and 59 sRNAs, respectively. Three of the H. somni strain 2336 sRNAs that were selected for further study and analyzed by Geneious Prime 1 are shown in Fig 1 and S1 File.
The sequencing data were screened manually using Geneious Prime 1 . The sequences of the sRNA candidates were comparable with those previously reported for H. somni 2336 [8], including those sRNAs that regulate metabolism of lysine (HS119) and glycine (HS72). The sRNAs isrK (HS144), tmRNA (HS23), and gcvB (HS166) were also identified. isrK and tmRNA encoded by the ssrA gene were highly expressed, as determined by RNA sequencing. Multi-sequence alignment of the H. somni sRNA candidate HS166 (gcvB) showed 99% identity to that of the homologous sRNA in P. multocida (Fig 2A). The comparative sequence analysis of HS166 to gcvB of P. multocida and E. coli revealed two conserved sequences (R1 and R2), which have been reported [36] (Fig 2B). Functional characterization of the above-mentioned sRNAs is needed in order to determine the exact role of these regulatory elements in H. somni 2336.

Identification and characterization of sRNAs in H. somni 2336
The potential non-coding intergenic regions of the H. somni 2336 genome were identified using Geneious Prime TM . Each of the sRNA candidates were manually checked for their relative read coverage level in the sequencing data and their conservation across different bacterial strains based on previous data [8]. A total of 8 sRNA candidates (HS9, HS14, HS26, HS72, HS79, HS86, HS97, HS98) were considered for further characterization based on preliminary bioinformatic analyses. All the selected sRNAs showed relatively high read coverage in the sequencing data. In addition, the sRNAs that are conserved with phylogenetically closer bacterial genomes, especially members of Pasteurellaceae family (Mannheimia haemolytica, P. multocida, Haemophilus influenza, etc.), but also in distantly related bacterial species, were selected with the intention of performing a comparative study. Multi-sequence analysis indicated that many of these sRNA regions in the genome may interact with the sigma-54 transcription factor (encoded by rpoN), which is known to control bacterial quorum sensing and virulence [38]. In the case of sigma-54 binding motifs, 11 bases are highly conserved across bacterial species [39]. For the candidates HS14, HS26, and HS98, 9 out of 11 bases of the conserved consensus bases in the sigma 54 binding motif were identical [39] (Table 4). In the case of HS9, and HS72, 8 of the 11 bases matched the conserved consensus bases in the sigma 54

Northern blotting to verify selected sRNA candidates
The selected 8 sRNAs candidates were chosen based on preliminary bioinformatic analysis and the relatively higher read coverage in the sequencing data. Three of these sRNA candidates (HS9, HS79, and HS97) were further selected based on bioinformatic predictions of their interaction with known virulence or biofilm genes in H. somni, such as luxR (found two luxR candidates in H. somni), narQ, uspE, and uspA. Northern blotting was used to determine the approximate size and expression level of these sRNAs, and to determine if they make any splice products. Northern blots demonstrated two bands of~115 bases and~60 bases for sRNA HS9, three bands of~80 bases,~130 bases, and~300 bases for HS79, and two bands of~140 bases and~200 bases for HS97 (Fig 3). A BLAST analysis of the sequences of the sRNA-specific probes with the complete H. somni genome, and optimized for "somewhat similar sequences", resulted in only one hit for each of the three sRNAs (HS9, HS79, HS97). Therefore, it was unlikely that the probe bound to more than one sRNA sequence in the genome. Rather, the multiple bands for each sRNA in the Northern blots suggest multiple transcripts being synthesized from the same promoter or post-transcriptional splicing.

Determination of the transcriptional start site of selected sRNA candidates
Two sRNA candidates, HS79 and HS97, were further characterized by 5'-RLM-RACE. The advantage of RLM-RACE over conventional RACE is that the former only amplifies the primary transcripts, and not the processed products. The bands (indicated with an oval shape) in Fig 4A were excised, cloned, and sequenced to determine the transcriptional start sites. For sRNA HS79, the transcriptional start site identified matches well with one of the bands (~300 bases) observed by Northern blotting. For HS97, the position of the transcriptional start site matched the band with a size of~140 bases in the Northern blot. Further analysis of the promoter region of the sRNAs with the online software BPROM (Softberry Inc.) revealed additional transcriptional start sites upstream of the transcriptional start site determined by

PLOS ONE
RLM-RACE (Fig 4B and 4C). Schematic representation of the chromosomal regions and sequences of sRNAs HS79 and HS97 are shown in Fig 4B and 4C.

EMSA to confirm the interaction of selected sRNAs with recombinant Hfq
To confirm the binding of sRNAs with Hfq, recombinant Hfq was expressed, purified, and incubated with in vitro-transcribed sRNAs HS9, HS79, and HS97 for EMSA ( Fig 5). As shown, all three sRNAs specifically bound to the recombinant Hfq, and the intensity of the band shift in EMSA was proportional to the concentration of recombinant Hfq added.

Prediction of secondary structures of selected sRNAs
Once the 5' terminus of the sRNA transcripts were assigned by RLM-RACE and examined by Northern blotting and bioinformatics, the resulting sequences were further analyzed for prediction of their secondary structures by RNAfold [31]. The predicted secondary structures of The scale bar for the RNA sequence coverage (labelled as "coverage") depicts minimum and maximum values for the entire alignment, and the tick to its right (range 0-4508) represents the mean coverage. The consensus sequence (top-most line) is displayed above the alignment or assembly, and shows those residues that are conserved. The six-frame translation (three overlapping reading frames in the forward direction and the complementary strand in the reverse direction) of the consensus sequence is displayed just below the consensus sequence. The top, dark blue line represents the reference genome sequence, and the six black lines below it represent the six-frame translation of the reference sequence. The representative zoomed images of the six-frame translation of the reference sequence and consensus sequence is shown in S1 File. The orange lines represent the maximum coverage for the RNA sequence. The sRNA names are shown close to their corresponding sRNA transcripts in the graphs. For HS97, the first region that has the transcript mapping represents the sRNA. https://doi.org/10.1371/journal.pone.0286158.g001 the sRNAs HS79 and HS97 are shown in Fig 6. These sRNAs contained single stranded unpaired regions rich in adenosine and uridine, and these AU-rich regions are proposed to be binding sites for Hfq in other bacteria [40].

Discussion
Despite recent advancements in sequencing technologies for the identification of bacterial non-coding RNAs, the role of sRNAs as post-transcriptional regulators in H. somni is unknown. In 2012, Kumar et al. [8] identified a number of sRNAs in H. somni based on an RNA-Seq based transcriptome map. In that study, RNA was isolated from H. somni cells, enriched mRNAs, sequenced, reads were mapped to the H. somni genome sequence, and the  [36] are shown above the alignment. The seed region is labelled as SR and is underlined. The predicted H. somni gcvB rho-independent terminator is marked as T1. The predicted start site of P. multocida gcvB, as determined by 5' RACE [37], is indicated by the grey arrow. All the positions in each sequence that are identical with the consensus are given in upper-case, the other positions are in lower case. The residues that are highly conserved appear in high-consensus red color. The residues that are weakly conserved are shown in low consensus blue color. Other residues are shown in neutral color, which is black. A position with no conserved residue is represented by a dot in the consensus line. https://doi.org/10.1371/journal.pone.0286158.g002

PLOS ONE
intergenic regions were analyzed to identify potential sRNAs. A total of 94 sRNAs were identified in the H. somni genome, out of which some sRNAs were unique to H. somni strain 2336 compared to the non-virulent strain 129 Pt [8]. However, none of the predicted sRNAs in H. somni were validated or further characterized in the above study. In our current study, the sRNAs that bind to the global regulatory protein Hfq were isolated by co-immunoprecipitation with anti-Hfq antibody, followed by sequencing, and further bioinformatic and laboratory analysis to identify and partially characterize potential sRNA candidates in H. somni. Hfq is known to contribute to regulation of important bacterial traits that include quorum sensing, virulence, pathogenesis, biofilm formation etc. in many Gram-negative bacteria [41][42][43]. Thus, the identification and characterization of sRNAs associated with Hfq will be valuable in

PLOS ONE
regard to understanding post-transcriptional regulation of genes involved in virulence and pathogenesis in H. somni. Out of the 100 intergenic sRNA regions identified in this study, 43 have been previously reported [8]. Kumar et al. identified a number of sRNAs that were not found in our study [8], but other sRNAs were identified in our study that were not detected by Kumar et al. [8]. The differences between the sRNAs identified in our study with that of Kumar et al. [8] could be due to the difference in culture conditions, the RNA extraction method, the selected interaction with Hfq, or the RNA type used for sequencing.
Northern blotting was carried out to determine the approximate size, abundance, and splice products of any of the selected sRNAs. Multiple bands in Northern blots of sRNAs is an indication of more than one transcript being synthesized from the same promoter or post-transcriptional splicing. Hfq is structurally and sequence-wise closely related to the Sm/Lsm family of

PLOS ONE
proteins involved in splicing in eukaryotes. Thus, it has been suggested that Hfq may be functionally similar to eukaryotic Sm proteins [44]. EMSA experiments with selected sRNA candidates confirmed that these sRNAs specifically bound to Hfq, supporting that Hfq likely contributes a supporting role for these sRNAs in H. somni. Transcriptional start sites were determined for selected sRNAs, which further enabled the structural prediction of these sRNAs. The secondary structure (containing AU-rich regions and several stem loops) of the selected sRNAs supported that these sRNAs interacted with Hfq, and the potential role of these sRNAs in the control of various biological processes [40,45]. Information on rho-independent terminators could not be obtained for many of the sRNAs identified using any of the online prediction tools, and further analysis is needed to categorize them as either novel sRNAs or novel genes.
A number of sRNAs, including isrK, tmRNA, gcvB and those that regulate metabolism of lysine and glycine were identified in this study and these results correlate with previous results on H. somni sRNAs [8]. Most of these sRNAs are known to play important regulatory roles in virulence, pathogenesis, acid resistance, thermotolerance, and osmotic stress response in many Gram-negative bacteria [46][47][48][49][50][51]. In S. enterica serovar typhimurium, isrK is expressed in the stationary phase under conditions known to regulate invasion, including low pH, low magnesium levels, and low oxygen [46]. tmRNA contributes to virulence and pathogenesis in many bacterial pathogens including Legionella pneumophilia, S. enterica serovar typhimurium, Francisella tularensis, P. aeruginosa, and others [47][48][49][50]. GcvB is known to regulate a number of amino acid transport systems, as well as amino acid synthesis genes in a number of pathogenic bacteria, including Yersinia pestis, Haemophilus influenzae, V. cholerae, S. enterica serovar typhimurium, Klebsiella pneumoniae, and P. multocida [37,[52][53][54][55][56]. GcvB contributes to important regulatory roles in oxidative stress, biofilm formation, and lipid A and lipopolysaccharide (LPS) modifications in Gram-negative bacteria [57][58][59][60]. In P. multocida, another bovine respiratory pathogen that is closely related to H. somni, the overexpression of gcvB results in increased lag phase growth [37]. Further characterization of gcvB in H. somni is needed to understand the regulatory role of gcvB in the virulence of this respiratory pathogen.
Target predictions were made for the 8 sRNAs identified with high read coverage in the sequencing data using IntaRNA [45]. This analysis revealed a number of interesting gene candidates, including the LuxR family candidate HS_0042 (nitrate/nitrite response regulator), narQ (encoding a nitrite/nitrate sensor kinase), LuxR family candidate HS_0901 (regulatory protein, sigma-70 region 4 type 2), uspE (encoding a stress response regulator), and uspA (encoding a stress protein) (Fig 7). The majority of these target genes bound to the typical A/U rich regions found in small RNAs. sRNAs are known to interact with multiple target genes, and their interaction is through the internal A/U rich regions and the 3' U-rich region after the transcriptional terminator [61]. In addition, some sRNAs were predicted to interact with their target genes at the UG-rich regions and the role of UG-rich regions in binding to target ). uspE encodes a stress response regulator and is required for virulence; uspA encodes a nonspecified stress protein; luxR (HS_0901) encodes a regulatory protein, sigma-70 region 4 type 2; narQ encodes a nitrite/nitrate sensor kinase; luxR (HS_0042) encodes a nitrate/nitrite response regulator. https://doi.org/10.1371/journal.pone.0286158.g007

PLOS ONE
mRNAs has been reported [62]. The stress protein UspA plays a crucial role in the regulation of stress resistance and virulence in Salmonella spp. [63]. Our group has reported that UspE is a global regulator controlling virulence and biofilm formation in H. somni [64]. The LuxR family candidate HS_0901 (regulatory protein, sigma-70 region 4 type 2) and LuxR family candidate HS_0042 (nitrate/nitrite response regulator) are yet to be characterized in H. somni. Recently, Martín-Rodríguez et al. reported that NarQ, along with the nitrate response regulator NarP, plays a regulatory role in biofilm formation by uropathogenic E. coli [65]. Thus, the functional characterization of the nitrate response regulator HS_0042 (LuxR family candidate) may identify its role in the regulatory network of biofilm formation in H. somni. In addition, further studies on the sRNAs that are predicted to be associated with these important regulatory genes would facilitate future studies to develop new promising strategies to effectively prevent diseases due to H. somni.

Conclusions
In this study, a total of 100 H. somni strain 2336 putative sRNAs, including previously unknown sRNAs, were identified that are associated with the global regulatory protein Hfq under the growth conditions tested. Highly conserved noncoding sRNAs, including the housekeeping sRNAs ssrS, ffs, and rnpB, and tmRNA were also identified. Bioinformatic analysis revealed that many of the sRNA candidates could potentially interact with genes involved in quorum sensing, virulence, and biofilm formation in many bacterial pathogens. This is the first study in which Hfq-associated sRNAs were identified and characterized in H. somni. Further functional characterization of these Hfq-associated sRNAs may shed light into the posttranscriptional regulatory network, and the role of Hfq and sRNAs in the virulence and pathogenicity of H. somni. Inzana.